Similarity Computations with an OAI-PMH Aggregator
نویسندگان
چکیده
The proliferation of the Open Archive Initiative Protocol for Metadata Harvesting (OAI-PMH) has resulted in the creation of a large number of service providers, all harvesting from either data providers or aggregators. If data were available regarding the similarity of metadata records, service providers could track redundant records across harvests from multiple sources as well as provide additional end-user services. Due to the large number of metadata formats and the diverse mapping strategies employed by data providers, similarity calculation requirements necessitate the use of information retrieval strategies. We describe an OAI-PMH aggregator implementation that uses the optional “” container to re-export the results of similarity calculations. Metadata records (3751) were harvested from a NASA data provider and similarities for the records were computed. The results were useful for detecting duplicates, similarities and metadata errors.
منابع مشابه
Initial Experiences Re-Exporting Duplicate and Similarity Computation with an OAI-PMH aggregator
The proliferation of the Open Archive Initiative Protocol for Metadata Harvesting (OAI-PMH) has resulted in the creation of a large number of service providers, all harvesting from either data providers or aggregators. If data were available regarding the similarity of metadata records, service providers could track redundant records across harvests from multiple sources as well as provide addi...
متن کاملREPOX - A Framework for Metadata Interchange
This demonstration presents an XML framework for metadata interchange. REPOX has two goals: to be a means for libraries and other cultural institutions to provide OAI-PMH access to their metadata records, independently of their original format, with a tool that is easy to install, use and deploy; and to be used as an aggregator of OAI-PMH Data Sources. The records are stored internally in XML a...
متن کاملCollecting metadata from institutional repositories
The purpose of this article is to review metadata issues identified in recent research carried out in Scotland on services based on metadata aggregation via OAI-PMH, and to examine the role of collection-level description in managing ingest to harvested repositories, subsequent harvesting by secondary aggregators, and the contextualisation of institutional and aggregated repositories in the wid...
متن کاملInterweaving OAI-PMH data sources with the linked data cloud
The Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) has found wide-spread adoption for exchanging bibliographic metadata. In parallel, the W3C’s Linked Data Initiative exposes and interlinks structured data from a variety of data sources on the Web. Since many of these data sources contain valuable information for institutional repositories (e.g., shared concept definitions,...
متن کاملMetadata Harvesting with R and OAI-PMH
The Open Archives Initiative (http://www.openarchives.org/) develops and promotes interoperability standards that aim to facilitate the efficient dissemination of content. One key project is the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH, http: //www.openarchives.org/pmh/) which provides “a low-barrier mechanism for repository interoperability” for archives (institutiona...
متن کامل